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Abstract 

G. Rauzy showed that the Tribonacci minimal subshift generated by the morphism r : 

I— > 01, 1 I— > 02 and 2 is measure-theoretically conjugate to an exchange of three fractal 
domains on a compact set in M?, each domain being translated by the same vector modulo a 
lattice. In this paper we study the Abelian complexity p{n) of the Tribonacci word t which 
is the unique fixed point of r. We show that p{n) G {3, 4, 5, 6, 7} for each n > 1, and that 
each of these five values is assumed. Our proof reUes on the fact that the Tribonacci word is 
2-balanced, i.e., for all factors U and F of t of equal length, and for every letter a G {0, 1, 2}, 
the number of occurrences of a in J7 and the number of occurrences of o in y differ by at most 
2. While this result is announced in several papers, to the best of our knowledge no proof of 
this fact has ever been published. We offer two very different proofs of the 2-balance property 
of t. The first uses the word combinatorial properties of the generating morphism, while the 
second exploits the spectral properties of the incidence matrix of r. 

1 Introduction 

Given a finite non-empty set A, called the alphabet, we denote by A* the free monoid gen- 
erated by A. The identity element of A* , called the empty word, will be denoted by e. For 
any word u = aia2 • • • a-n G A* , the length of u is the quantity n and is denoted by \u\. By 
convention, the length of the empty word e is taken to be 0. For each a ^ A,\Qi\u\a denote the 
number of occurrences of the letter a in -u. We denote by A^ the set of (right) infinite words 
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on the alphabet A. Given an infinite word uj = ujqluilu2 • • ■ ^ A , any finite word of the form 
cjjWi+i • • • uJiJ^n-i (with i > and n > 1) is called & factor of u. Let 

^u>{n) = {WjWi+l • • • Wj+n-l N > 0} 

denote the set of all factors of uj of length n, and set Puj{n) = Card(^^(n)). The function 
Po; : N ^ N is called the subword complexity function of lo. A fundamental result due to 
Hedlund and Morse [ 15 1 states that a word lo is ultimately periodic if and only if for some n 
the subword complexity Puj{n) < n. Words of subword complexity p{n) = n + 1 are called 
Sturmian words. The most well-known Sturmian word is the so-called Fibonacci word 

f = 01001010010010100101001001010010010100101001001010010 • • • 

fixed by the morphism 01 and 1 0. 

Sturmian words admit various types of characterizations of geometric and combinatorial 
nature. We give two examples: In [.16il Hedlund and Morse showed that each Sturmian word 
may be realized measure-theoretically by an irrational rotation on the circle. That is, every 
Sturmian word is obtained by coding the symbolic orbit of a point x on the circle (of circum- 
ference one) under a rotation by an irrational angle a where the circle is partitioned into two 
complementary intervals, one of length a and the other of length 1 — a. And conversely each 
such coding gives rise to a Sturmian word. 

Sturmian words are equally characterized by a certain balance property. We begin with the 
following definition: 

Definition 1.1. An infinite word uj G J^^ is said to he C-balanced (C a positive integer) if 
\W\a ~ \^\a\ ^ C for all factors U and V of oj of equal length, and each a A. We say u> is 
balanced if it is 1-balanced. 

Theorem 1.2 (Theorem 2.1.5 in llT3l ). An infinite word lj is Sturmian if and only if uj is a 
binary aperiodic balanced word. 

In lITTl fTFl. Rauzy showed that the regular continued fraction algorithm provides a formidable 
link between the arithmetic/diophantine properties of an irrational number a, the ergodic/dynamical 
properties of a rotation by angle a on the circle, and the combinatorial (balance) properties of 
Sturmian words. 

A fundamental problem is to generalize and extend this rich interaction to higher dimen- 
sions, either by starting with a specified symbolic dynamical system, or by a class of words 
satisfying particular combinatorial conditions. In case of dimension 2, there are a couple of dif- 
ferent dynamical systems which are natural candidates: For instance, promising results have 
been obtained by the third author together with Ferenczi and Holton by considering the dy- 
namics of 3-interval exchange transformations on the unit interval ||8l|9l[T0l[Ill, and also by 
Amoux, Berthe and Ito by considering two rotations on the circle O. However, ever since 
the early work of Rauzy in |[T9i . the most natural generalization was thought to be the one 
stemming from a rotation on the 2-torus. In this context, the associated symbolic counterpart 
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is given by a class of words of subword complexity 2n + 1, now called Amoux-Rauzy words, 
originally introduced by Arnoux and Rauzy in Q. In this setting, the Tribonacci word 



t = lim t"(0) = 01020100102010 ■ 

n^oo 

defined as the fixed point of the morphism r 

01 1 02 2 0, 



is the natural analogue of the Fibonacci word f . In fact, in 01911 Rauzy showed that the Tri- 
bonacci minimal subshift (the shift orbit closure of t) is a natural coding of a rotation on the 
2-dimensional torus T^, i.e., is measure-theoretically conjugate to an exchange of three fractal 
domains on a compact set in R^, each domain being translated by the same vector modulo a 
lattice. In ||6l, the third author together with Cassaigne and Ferenczi showed that there exist 
Arnoux-Rauzy words which are arbitrarily imbalanced, that is to say, which are not C-balanced 
for any positive integer C, and that such Arnoux-Rauzy words cannot be measure-theoretically 
conjugate to a rotation on the 2-torus. 
In the first part of this paper, we prove: 

Theorem 1.3. The Tribonacci word t is 2-balanced. 

While this result is announced in several papers (see for instance llHISl ID [121 EH), to 
the best of our knowledge no proof of this fact has ever been published. In this paper we 



give two different proofs of Theorem 1.3 The first is a proof by induction which uses the 
word combinatorial properties of the generating morphism r. The second proof relies on the 
spectral properties of the incidence matrix and is more in the spirit of the methods developed 
by Adamczewski in HI. 

Our hope is that one of the two proofs may be extended to establish a balance property for 
the general m-bonacci word (m > 2) defined as the fixed point of the morphism 

Oh^OI 1h^02 ••• (m - 2) 0(m - 1) (m - 1) 0. 

Computer simulations show that the 4-bonacci word is not 2-balanced. In fact, let u be the 
factor of the 4-bonacci word of length 3305 occurring in position 2663 (starting with 0), and v 
the factor of length 3305 occurring in position 9048. Then |ii|i = 891 and |f |i = 888, which 
shows that the 4-bonacci word is not 2-balanced. It is possible that in general the m-bonacci 
word is (m — l)-balanced as stated in [12]. 



We then apply Theorem 1.3 to study the so-called Abelian complexity of the Tribonacci 
word. Following [|20|| , two words u and v in A* are said to be Abelian equivalent, denoted 
u V, if and only if \u\a = \v\a for all a ^ A. For instance, the words ababa and aaabb 
are Abelian equivalent whereas they are not equivalent to aabbb. It is readily verified that 
defines an equivalence relation on A* . 

We define 
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and set = Card(^5^(n)). The function p = : N ^ N which counts the number of 
pairwise non Abelian equivalent factors of uj of length n is called the Abelian complexity or 



ab-complexity for short. Using Theorem 1.3 we compute the ab-complexity of the Tribonacci 
word t: 

Theorem 1.4. Let t denote the Tribonacci word. Then pt{n) E {3,4,5,6,7} for every positive 
integer n. Moreover, each of these five values is attained. 

We also obtain several equivalent characterizations of those values n for which p{n) = 3. 
As a consequence we show that p{n) = 3 for infinitely many n. Similarly we show that 
p(n) = 7 for infinitely many n, although the least n for which p{n) = 7 is n = 3914. We 
conclude with some open questions. 

Acknowledgements: The second author is partially supported by grant no. 8121419 
from the Finnish Academy. The third author is partially supported by grant no. 090038011 
from the Icelandic Research Fund. 



2 Two Proofs of Theorem 1.3 



2.1 Preliminaries to Proof 1 of Theorem 1L3 

Let r : {0, 1, 2} ^ {0, 1, 2}* denote the morphism 

01 1 02 2 0, 

and let 

t = lim t"(0) = 01020100102010 • • • 

71— >00 

denote the Tribonacci word. 

For each u S {0, 1,2}*, we denote by ^{u) the Parikh vector associated to u, that is 

^'(m) = (|n|o, |n|i, |u|2). 

For each infinite word uj G {0, 1,2}^ let ^u){n) denote the set of Parikh vectors of factors of 
length n of a;: 

= {^{u)\u£ J'M}. 

Thus we have 

pM =Card(^'^(n)). 
We will need the following two lemmas. 

Lemma 2.1. Let U be a non-empty factor oft. Then there exists a factor u of t such that 
^'(C/) = + 5, \u\q, \u\i) for some 5 G {-1, 0, 1}. Moreover if \U\ > 3 then \u\ < \U\. 
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Proof. It is readily seen that every non-empty factor ?7 of t may be written as either U = t{u), 
or U = 0~^r(n), or U = t{u)0, or U = 0~^t{u)0 for some factor u of t. Moreover, 
|r('u)|o = \u\, \t{u)\i = \u\o, and |r(u)|2 = Thus we have that 

*([/) = (|n| + (5,|'u|o,|u|i) 

for some 6 G {— 1,0,1}. Moreover, from above it follows that \U\ > \t{u) \ — 1. Also, for all 
u we have \t{u)\ > \u\, and if \u\ > 3, then \t{u)\ > |tt| + 2 since either u contains at least 
two occurrences of 0, or u contains both an occurrence of and an occurrence of 1. Finally, to 
see that |u| < \U\ whenever \U\ > 3, we suppose to the contrary that |n| > \U\ > 3; then 

> \t{u)\ - 1 > |it| + 1 > \u\, 

which is a contradiction. 

□ 

Note: In what follows, we will often apply the above lemma to a pair of words U and V where 
l^lo = l^^lo + 2 and \U\i = \V\i + 3 for some i € {1, 2}. Note that under these hypotheses, 
\U\>7 since U contains either at least 3 occurrences of 1, or at least 3 occurrences of 2. Since 
every 1 and 2 occurring in t is always preceded by 0, we deduce that |?7|o > 2, and hence 
|F|>4. 

Lemma 2.2. Let U and V be factors oft satisfying the following two conditions: 
\V\o = \U\o + 2 

\U\i = \V\i + 3 

for some i G {1, 2}. Then there exist factors uandv oft, with \u\ < \U\, \v\ < \V\, \u\ < \v\, 
and \u\i-i = \v\i-i + 3. 

Proof. By the previous lemma there exist factors u, v with \u\ < \U\, \v\ < \V\, such that 

*([/) = (|n| + 5i,|n|o,|u|i) and ^{V) = {\v\ + S2,\v\q,\v\i) 
for some 61,62 G {—1, 0, 1}. By hypothesis we have 

l^^l + ^2 = \V\o = \U\o + 2=\u\ + 6i + 2, 

whence 

|u| = + ^2 — — 2 < \v\. 
Finally, the condition \U\i = \V\i + 3 for i G {1,2} implies |n|i_i = l^li-i + 3. 

□ 

In what follows, we will make use the following terminology: We say two finite words U 
and V on the alphabet {0, 1,2} are pairwise 2-imbalanced if \U\ = \V\ and \ U\i - \V\i\ > 3 
for some i G {0, 1,2}. 
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2.2 Proof 1 of Theorem O] 

Suppose to the contrary that t is not 2-balanced. Then there exists a shortest pair of factors U 
and V with \U\ = \V\ and \U\i - \V\i > 3 for some i G {0, 1, 2}. If \U\i - \V\i > 3, then 
by removing the last letter from each of U and V, we would obtain a shorter pair of words of 
equal length which are pairwise 2-imbalanced. Thus the minimality condition on \ U\ implies 
that \U\i — \ V\i = 3. Also, it is easily checked that all factors of length less or equal to 3 are 
2-balanced whence \ U\ > 4. We consider three cases: i = 0, then i = 2, and finally i = 1. 



Case 1: \U\o — \V\o = 3. By Lemma 2.1 there exist factors u,v with \u\ < \U\, \v\ < \V\, 
such that 

^'(C/) = (|^/| +(51,1^10,1^11) and ^'(y) = + ^2, |w|o, |v|i) 
for some Si, 62 G {—1, 0, 1}. The condition \U\o = \V\o + 3 implies that 

\u\ + 61 = \v\ + 62 + 3 

that is 

1^1 - |t;| = (3 + - Si) > 0. 

The condition \ U\ = \ V\ implies that 

\u\ + 61 + \u\o + \u\i = \v\ + S2 + \v\o + |f |l 

or equivalently 

2\u\ + 61 - \u\2 = 2\v\ +52- \v\2. 

Thus 

\u\2 - \v\2 = 2{\u\ -\v\) + 6i-62 = 2(3 + §2 - Si) + 61 - 62 = 6 + 82 - Si. 

Let n' be the prefix of M of length I u| — (3 + ^2 — (^i) = |f|.Then, \u'\2 > 1^12 + 3 contradicting 
the minimality of | ?7| . 



Case 2: |?7|2 — \V\2 = 3. By Lemma 2.1 there exist factors u,v with \u\ < \U\, \v\ < \V\, 
such that 

'^{U) = {\u\+Si,\u\oAu\i) and ^'(F) = + ^2, |w|o, |v|i) 

for some 5i, ^2 S Oi !}• The condition |f7|2 = \V\2 + 3 implies that \u\i = |f |i + 3. If 
|m| < \v\, then u and the prefix of v of length \u\ are pairwise 2-imbalanced and of length less 
than \ U\, contradicting the minimality of |?7|. Thus we can suppose that 

\u\ = \v\ + m for some m > 1. 

If m > 2, then we deduce that 

\U\o = \u\ + Si > \u\ - 1 > \u\ - (m - 1) = + 1 > \v\ + S2 = \V\o. 
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As \U\2 = \V\2 + 3, \U\o > \V\o, and \U\ = \V\, it follows that \V\i > \U\i + 3, that is 
l^^lo ^ l^^lo + 3. But then v and the prefix of u of length \v\ are pairwise 2-imbalanced and of 
length less than \ U\, contradicting the minimality of |C/|. 

Thus we can suppose m = 1, that is \u\ = \v\ + 1. Again, if \U\o > \V\o, as above we 
would deduce that |t;|o > |u|o + 3 which would give rise to a contradiction. So we must have 
that \U\o < \V\o. This gives 



\v\ + l + 6i = \u\ +5i = \U\o< \V\o = \v\ + 62 



that is 



which in turn implies that 



Thus 



1 + -5l < ^2 
Si = —1 and 82 = I- 



^'(C/) = (l-ul - l,|?i|o,|u|i) and "^{V) = {\v\ + l,\v\o,\v\i). 
Finally, the conditions \ U\ = \ V\ together with |n| = + 1, and |n|i = \v\i +3 imply that 



l^lo = \v\o + {\v\ + 1 + It'll) - (I 

= 1^1 - (bl + 1 + l^^ll) 

= \V\ - {\u\ + \u\i -3) 
= \V\ - (|u| + |n|i - 1 - 2) 
= |y|-(|[/|-|n|o-2) 
= \u\o + 2. 



v\ + 1 + \v\ 



In summary we have: |n|i = |w|i + 3 and |t>|o = |ti|o + 2. Thus we can apply Lemma 2.2 
which gives a contradiction to the minimality of |C/|. 



Case 3: \U\i — \V\i = 3. By Lemma 2.1 there exist factors u,v with |u| < |?7|, \v\ < \V\, 
such that 



-^{U) = {\u\+6i,\u\o,\u\i) and "^{V) = {\v\ + 62,\v\oAv\i) 

for some 61, 62 G {—1, 0, 1}. The condition \U\i = |y|i + 3 implies that \u\o = \v\o + 3. 

If |m| < I I, then u and the prefix of of length I n| are pairwise 2-imbalanced and of length 
less than \ U\, contradicting the minimality of |C/|. Thus we can suppose that 

|m| = \v\ + m for some m > 1. 

Next we proceed like in Case 2. If m > 2, then we deduce that 

\U\o = \u\ +5i>\u\-l> \u\ - (m - 1) = |i;| + 1 > |t;| +82 = \V\o. 
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As |;7|i = \V\i + 3, \U\o > \V\o, and \U\ = \V\, it follows that \V\2 > \U\2 + 3, that is 
l^^li > l^li + 3. But then v and the prefix of u of length \v\ are pairwise 2-imbalanced and of 
length less than \ U\, contradicting the minimality of |C/|. 

Thus we can suppose m = 1, that is \u\ = \v\ + 1. Again, if \U\o > \V\o, as above we 
would deduce that > \u\i + 3 which would give rise to a contradiction. So we must have 
that \U\o < \V\o. This gives 

\v\ + l + 5i = \u\ +Si = \U\o< \V\o = \v\ + 52 

that is 



which in turn implies that 
Thus 



l + 6i<S2 
6i = —1 and 62 = 1- 



^{U) = {\u\-l,\u\o,\u\i) and ^{V) = {\v\ + l,\v\o,\v\i). 
The conditions \U\ = \V\ together with \u\ = \v\ + 1, and \u\q = \v\q + 3 imply that 

l^^li = \v\i + {\v\ + 1 + \v\o) - {\v\ + 1 + |f |o) 

= \V\-{\v\ + l + \v\o) 

= \V\-{\u\ + \u\o-3) 

= \V\-{\u\ + \u\o-l-2) 

= \V\-m-Hi-2) 
= \u\i+2. 

In summary we have 

\u\ = \v\ + 1 and = |u|i + 2 and |ti|o = |f |o + 3. 



Since \u\, \v\ > 3, Lemma 2.1 implies that there exist factors u' , v' with |n'| < \u\, \v'\ < \v\, 
such that 

^{u) = {\u'\ + 5[,\u'\o,\u'\i) and ^{v) = {\v'\ + 6'2,\v'\o,\v'\i) 

for some 6[, 62 G {-1, 0, 1}. 

As |u|i = |m|i + 2 we deduce that \ v'\o = |ti'|o + 2. Similarly, the condition \u\o = |f |o + 3 
implies that 

\u'\+6[ = \v'\+6'2 + 3 

so that 

= \u\ + 5'i — 82 — 3 < \u\ — 1. 

Now if < \u'\ — 2, then if v" is any factor of the Tribonacci word of length \u'\ 
beginning in v' , we would have \v"\q > \v'\o + 1 = |n'|o + 3 as every factor of length 2 or 
greater contains at least one occurrence of the letter 0. 
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So we can assume that = |m'| — 1. Then 

It^'l + 1 + 6[ = \u'\ + 6'i = \u\o = \v\o + 3 = \v'\ + ^2 + 3 

that is 



which in turn implies that 



6[ -62 = 2, 
6[ = 1 and 62 = -1. 



So 



I 'I 
P 1 



Finally, 



u\i + {\v'\ + \v'\o) - {\v'\ + \v'\o) 

u'\i + {\v'\ + 1 + 1 + \v'\o - 2) - {\v'\ - 1 + \v'\o + 1) 

u'\i + {\u'\ + 1 + In'lo) - {\v'\ - 1 + \v'\q + 1) 

u'\i + {\u\ - \u'\) - {\v\ - \v'\i + 1) 

u\ — {\v\ — \v'\i + 1) 

u\ - {\v\ + 1) + \v'\i 

v'\i 



\u'\2 = \u'\2 + {\u'\o + - 1) - (|ti'|o + \u'\l - 1) 

= \u \ - 1 - (|n'|o + |n'|i - 1) 
= \v'\-{\v'\o-2 + \v'\i-l) 
= b'|2 + 3. 



So in summary we have 



\v lo 



|ii'lo + 2 and lii'b = k'b + 3 



to which we can apply Lemma 2.2 to obtain the desired contradiction. 



2.3 Preliminaries to Proof 2 of Theorem 1.3 



For the results in this section, see [14, chapter 10]. 

We write t = uqUiU2 • • • with Uj G {0, 1,2}. The Tribonacci numbers Tfc are defined by 
n = |r^(0)|. Hence n = T^-i + Tk-2 + Tk^3 for A; > 3, and 



To = l, T, = 2, T2 = 4, T3 = 7, r4 = 13, 



Let 



M 



/l 1 1^ 
1 
\0 1 0, 
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denote the incidence matrix of the morphism r, i.e., Mij = \T{j)\i for all i,j £ {0, 1,2}. The 



eigenvalues of M are the roots of the polynomial 



X 



X — 1. We denote them 13, a, and 



a so that f3 is real and a and a are complex conjugates. (Given a complex number z = a + bi, 
we denote hy z = a — bi the complex conjugate.) We have 



/3 = 1.83928... 



and 



a 



a 



0.73735 . . . 



More information concerning the roots a and (3 may be found in the comments of se- 
quences A058265 and A000073 in "The On-Line Encyclopedia of Integer Sequences". In 
particular, some formulas relating a and a to /3 are recalled. 
It is well-known that the frequencies of letters in t exist and 



Freqt(0) = -^, Freqjl) = Freqt(2) = 



3 ■ 



Set 























\ , Va = 

















Then Vj3,Va,Va are eigenvectors of M corresponding to (3, a, a, respectively, normalized so 
that in each case the sum of coordinates equals 1. Let us denote by ap,aa, CLa the complex 
numbers for which 

ai3Vi3 + QaVa + ttaVa = 

The numbers ap,aa and are unique, and therefore 

CLa = CLa and \aa\ = \aa\ = 0.14135 

Denote by 











(o 


ei = 


, 62 = 


1 


















Then we can represent letter frequencies as Freqt(i) =<vp,ei> for each letter i = 0,1,2, 
where <•, •> denotes the Hermitian scalar product. 



2.4 Proof 2 of Theorem 1.3 



Every natural number N has a unique Zeckendorff Tnhonacci representation 



N 



y^PkTk, 



k>0 



where pk € {0, 1}, all but finitely many pk equal 0, and if pk = Pk-i = 1» then pk~2 = 0. 
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It can be shown (see flT, Proposition 10.7.4 on page 510]) that for each letter i e {0, 1, 2}, 

\uo---UN^i\i = afs <vi3,ei> '^PkP'' + <Va,ei> ^^pua^ + <Va,ei> '^pka'', 

k>0 k>0 k>0 

and 

k>0 k>0 k>0 

These two identities imply 

|uo • • • liTV-ili - N <Vfj, ei> = aa <Va - v/3, ei> ^^PkO^ + an <Va - v/3, ei> ^^PkO^ 

k>0 k>0 

= 2Re(aa <Va - Vf3,ei> ^Pka''), 

k>0 

fc>0 

(1) 

where Re(-) denotes the real part of a complex number. The second equality above holds 
because complex conjugation commutes with multiplication and addition. 

The following lemma shows that in order to prove Theorem 1.3 it suffices to determine 
adequate upper and lower bounds for the above equation ([T]): 

Lemma 2.3. Let x = xoXiX2 ■ ■ ■ be an infinite word on a finite alphabet yl, and b £ Aa letter 
occurring in x. If there exist real numbers A, B and jfor which 

A < \xoXi ■ ■ ■ XN-l\b — < B 

for all integers N > 1, then the word x is [2{B — A)\-balanced with respect to the letter b, 
i.e., 

\\U\,-\Vk\<[2{B-A)\ 
for all factors U, V of equal length. 
Proof. Let i, j > be integers. Since 

\xiXi+i ■ --Xi+N^ilb - iV7 = (\xoXi ■ --Xi+N-ilb - {N + i)-f^ - (^\xoXi ■ ■ ■ Xi-i\b - ij^, 
we see that 

A - B < \xiXi+i ■ ■ ■ Xi+N-i\b - N-f < B - A. (2) 

Consequently, 

' <{B-A)-{A-B) = 2{B- A). 



\XiXi+i ■ ■ ■ Xi+N-l\b — \XjXj+i ■ ■ ■ Xj^N-l\b 



□ 
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We now determine good upper and lower bounds for Eq. ([T]l. The numerical calculations 
in Propositions 2.4 2.5 and 2.6 were verified independently by the first and second authors via 
different computer programs. 



Proposition 2.4. We have 



Proof. By ([T]l, we have 



-0.6 < \uqU\ ■ ■ ■ UAr_i|o — ^ < 0.9. 

p 



\U0 ■ ■ -UN-llo 



N 



k>0 



a (3 



Set 



Si = ^ 2pfcRefaa( 



0<fc<7 

Then we have 



1 1. 



Sr = y^2pfc Re(aa( 



fc>8 



1 1 



la 



N 



Sl — \Sr\ < Sl + Sr — \uo ■ ■ ■ Uat-iIo — ^ < Sl + \Sr\. 



(3) 



Choosing the pk so that pfc = 1 if and only if Re(aa(^ — is positive, and similarly for 

negative values, it can be seen that 

- 0.42 <Sl< 0.73. 
To estimate the magnitude of we compute 



1 1 

a (3 



1.72457., 



Thus 



1 1 

a (3 



|a|'' = 2\aa 



k>8 



\Sr\ < 2\aa 

The claim follows from inequalities Q, ([4]), and ([6]l. 
Proposition 2.5. We have 



1 1 

a (3 



\a\ 



1 — a 



< 0.17. 



(4) 

(5) 

(6) 

□ 



-0.775 < \uqUi ■ ■ ■ nAr_i|i — ^ < 0.725. 



N 



Proof. Analogously to the proof of Prop. 2.4 

1 1 . 



set 



Sl= ^ 2pkRe(aa{ 



0<k<10 



Sr= 2pkRe(aa{ 



k>ll 



1 1 
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Again, by choosing the pk appropriately, it can be verified that 

-0.70 <Sl< 0.65. 
To find an upper bound for |S'ij|, we compute 

^--^ = 1.96298... 

Therefore, 



\Sr\ < 2|a«| 



1 1 



\a\'' = 2\ao 



k>ll 



1 1 



a 



11 



1 — lal 



< 0.075. 



These inequalities imply the claim. 
Proposition 2.6. We have 



-0.88 < \uoUi ■ ■ ■ UN-i\2 - ^ < 0-62. 



Proof. Once again we set 

Sl= ^ 2pkRe(^aa{ 

0<fc<13 

As above, it can be verified that 
We compute 



1 1 

a3 ~ /33 



fc>14 



1 1 



ja 



-0.8371 <Sl< 0.5764. 



1 1 



2.33887..., 



and thus 



\Sr\ < 2|a, 



1 1 

" /33 



fc>14 



1 1 



a 



14 



1 — lal 



< 0.0354. 



The claim follows from these inequalities. 

Propositions 2.4 -2.6 together with Lemma [23] imply that t is 2-balanced. 



Remark 2.7. None of the inequalities in the claims of Propositions 2.4 -2.6 are optimal. 



(7) 



□ 



(8) 



□ 



3 Abelian complexity 



In this section we study the Abelian complexity of the Tribonacci word and prove Theorem 1 .4 
We begin by recalling the following basic fact from |[20l : 

Fact 3.1. If an infinite word uj has two factors u and v of the same length nfor which the ith 
entry of the Parikh vector are p and p+c respectively, then for each i = 0, . . . ,c, there exists a 
factor ug ofu of length n such that the ith entry of^(ue) is equal to p + £. 

Henceforth, we denote p the Abelian complexity of the Tribonacci word t . 
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(i+2,j, k-1) (i+2,j-l,k) 



i+l,j+l,k-l) (i+l,j,k) (i+l,j-l,k+l) 



(i,j+2,k-l) (i,j+l,k) (i,j,k+l) (i,j-l,k+2) 



(i-l,j+2,k) (i-l,j+l,k+l) (i-l,j,k+2) 



Figure 1 : Links between Parikh vectors 



3.1 Proof of Theorem [L4] 

Let us recall that a factor u of an infinite word lo is right (resp. left) special if there exist 
distinct letters a and b such that the words ua and ub (resp. au, bu) are both factors of uj. A 
factor which is both left and right special is called bispecial. 

It is well-known that for every n > 1, t has exactly one right special factor of length 
n — 1, and that, for this special factor that we denote t^_i, the three words t^-i^' ^n-i^^ 
tn-i'^ ^^^^ factors of t of length n. Writing ^'(t^_^) = (i, j, k), where k are each 
non-negative integers, we define 

Central(n) = {{i + l,j,k), + l,k),{i,j,k + 1)}. 

We have 

Central(n) C ^t{n). (9) 

Given a vector 'v' = (a,/5,7), set ||1?|| = max(|a|, I7I). The set of vectors ~v such 
that 1 — 1?| I < 2 for all "u in Central(n) is given by the graph in Figure [T] (whose vertices 
are vectors, and where two vertices 'u are joined by an undirected edge if and only if 

1 11? — 1?| I = 1). 

Since t is 2-balanced, ^'t(^^) is a subset of this set of twelve vectors. Moreover for the 
same reason, we should have 1 — 1?| | < 2 for all "u, 1? in ^'t(^^). This implies that the only 
possibility for ^'t(n) is to be a subset of one of the three sets delimited by a regular hexagon 
in Figure[TJ or one of the three sets delimited by an equilateral triangle of base length 2. These 
sets have cardinalities 7 and 6 respectively showing that Puj{n) < 7. □ 

Remark 3.2. By computer simulation we find that 

(/o(n))„>i = 334344434444443444444444444345544444554444 . . . 
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In particular, the least n for which p{n) = 5 is for n = 30. We also found that the smallest n 
for which p{n) = 6 is n = 342, and the smallest n for which p{n) = 7 is n = 3914. The next 
four values of n for which p{n) = 7 are n = 4063, 4841, 4990, 7199. 



3.2 More on the Abelian complexity of the Tribonacci Word 

In this section, we characterize those n for which pt{n) = 3. 

We continue to adopt the notation t^^, ^{t^_i), Central(n), and introduced in the 



proof of Theorem 1.4 Let B{n) be the set of vectors v for which there is exactly one u in 
Central(n) with — =2: 

B{n) = {{i - 1, j + 1, A; + 1), (i + 1, j - 1, A; + 1), {i + l,j + l,k- 1)}. (10) 

Proposition 3.3. The following are equivalent: 

(1) For all factors v and w oft of length n, we have \ \v\a — \w\a\ < ^for all a G {0, 1,2}. 

(2) pt{n) = 3. 

(3) ^t{n)nB{n) = 0. 

(4) t contains a bispecial special factor of length n — 1. 

(5) n = l,orn= ^{Tm + Tm+2 - I) form > 0, where {Tm)m>o = 1,2,4,7, 13,24, . . . 
denotes the sequence of Tribonacci numbers. 

Proof. Equivalence (1) (2) is an immediate consequence of Eq. (|9]l. To see the equivalence 
between items (4) and (5), we note that the bispecial factors of t are precisely the pahndromic 
prefixes of t (see ||2T1). The lengths of these words are known to be 

2 ~ 2 

(see Corollary 3.1 1 in |7] or Corollary 3.3 in ll23i ). It follows that (4) 4^ (5). Thus it remains 
to prove that items (2), (3), and (4) are equivalent. We will make use of the following lemmas. 

Lemma 3.4. Let n > 1, and let v be a factor of t. Put ^^(t^^^) = (i, j, k), and '^{v) = 
{i' ,j' , k'). Suppose either 

• \v\'>n and either i' < i — 1 or j' < j — 1 

or 

• |f I < ^T-, and either i' > i + 2 or j' > j + 2. 
Then p{n) > 3. 

Proof. The first condition implies the existence of a factor v' of t of length n with either 
l^^'lo < z — 1 or If' 1 1 < J — 1. By (|9]l it follows that the factors of t of length n are not 
balanced. Similarly, the second condition implies the existence of a factor v' of t of length n 
with either 1 1;' | o > i + 2 or | 1 1 > j + 2. Again by ([9]) it follows that the factors of t of length 
n are not balanced. □ 
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For each n > 1, let 0(n) = |r(t<„i)0| + 1. 
Lemma 3.5. If p{n) = 3, then p{cl){n)) = 3. 

Proof. Assume p{n) = 3, and set N = 4>{n). Let ^'(t^_^) = (i, j, k), and t 
r(t<_^)0, it follows that 



< 

N~l 



Since t^_^ 



Now we have 



and 



I = n; J = i; K = j. 

^t{n) = {{i + k), + 1, k), k + 1)} 
{{I + 1,J,K),{I,J + 1,K),{I,J,K + 1)} C vI/t(Ar) 



{I,J,K). 



(11) 



and we must show that in fact we have equality in the previous containment. By Theorem 1.3 



the only other possible Parikh vectors of factors of t of length N are one of the following six 
vectors 

fl-1 
J 

\K + 2 

or one of the following three vectors 














J + i 


\k-i) 




[k + i) 



Using Lemma 3.4 we will show that if any one of these nine vectors belong to ^ti^i), then 
p{n) > 3, a contradiction. Hence p{N) = 3. We proceed one vector at a time. 



o Suppose {I — 1, J, K + 2) G ^'t(A^). By Lemma 2.1 there exists a factor v of t with 
q,(v) = {i',j',k') and(5 G {-1,0,1} such that 



I-l 
J 

K + 2 



\v\ + 6 



f 



It follows from (fTTll that \v\ = n — 1 — 5 < n, and j' = j + 2 which contradicts Lemma 3.4 



o Suppose (/ — 1, J + 2,K) G ^t(-^)- By Lemma 2.1 there exists a factor w of t with 
y^[v) = {i',j',k') and6e {-1,0,1} such that 



I-l 
J + 2 
K 



\v\ + 5 



] 



It follows from (fTTll that \v\ = n — 1 — 5 < n, and i' = i + 2 which contradicts Lemma 3.4 
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o Suppose {I, J — 1, K + 2) G ^'t(A^). By Lemma 2.1 there exists a factor w of t with 
qf(v) = {i',j',k') andJ G {-1,0,1} such that 



I =\v\+5 
J -I =i' 
K + 2 =j'. 



It follows from ( [TT] ) that \ v\ = n — 5<n + l,i' = i — l, and j' = j + 2 which contradicts 
Lemma [331 



o Suppose (/, J + 2,K — 1) G ^'t(A^). By Lemma 2.1 there exists a factor w of t with 
^i{v) = ii',j',k') and5 G {-1,0,1} such that 

I =\v\+6 
J + 2 =i' 
K-1 =f. 



It follows from ( ]_J_) that \ v\ = n — 6<n + l,i' = i + 2, and j' = j — 1 which contradicts 
Lemma [331 



o Suppose (/ + 2, J — l,i^) G ^'t(A^). By Lemma 2.1 there exists a factor t; of t with 
q,(v) = ii',j',k') and(5 G {-1,0,1} such that 

1 + 2 =\v\+5 
J -I =i' 
K =j'. 



It follows from ( [TT] ) that |?;| = n + 2 — 6 > n + 1, and i' = i — 1 which contradicts 
Lemma [331 



o Suppose {I + 2, J, K — 1) G ^'t(A^). By Lemma 2.1 there exists a factor u of t with 
^^f{v) = {i',j',k') andSe {-1,0,1} such that 

1 + 2 =\v\+6 

J =i' 
K-1 =j'. 



It follows from (l]_i that |f;| = n + 2 — 6 > n + 1, and j' = j — 1 which contradicts 
Lemma [331 

This concludes the first six cases. Now we consider the last three: 



o Suppose (I + 1,J + 1,K — 1) G ^'t(A^). By Lemma 2.1 there exists a factor w of t with 
qf(v) = {i',j',k') and5 G {-1,0,1} such that 

/+1 =\v\+5 
J+1 =i' 
K-1 =j'. 
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It follows from (fTTIl that = n + 1 — 6 > n, and j' = j — 1 which contradicts Lemma 3 .4 



o Suppose {I + 1,J — 1,K + 1) e ^'t(A^). By Lemma 2.1 there exists a factor w of t with 
qj(v) = ii',j',k') and5 G {-1,0,1} such that 

/+1 =\v\+6 
J -I =i' 
K + l =/. 



It follows from (fTTI) that \v\ = n + 1 — 5 > and i' = i — \ which contradicts Lemma 3.4 



o Suppose (/ — 1,J + 1,K + 1) G ^'t(A^). By Lemma 2.1 there exists a factor v oit with 
^ll{v) = ii',j',k') md6G {-1,0,1} such that 

I -I =\v\+5 
J+1 =i' 
K + l =/. 



It follows from ( j_J_i that \v\=i' + j' + k' = n — 1 — 5 = i + j + k — 6,i' = i + l, and 
/ = j + 1. Thus we have k' = k - 2 - 6. If 5 = -1, then (i + 1, j + 1, A: - 1) G ^'t(ra) 
contradicting that p(n) = 3. If 5 = 0, then |?;| = n — 1 and k' = k — 2. Thus any factor of 
t of length n beginning in v will have at most k — 1 many 2's, contradicting that the balance 
between any two factors of length n is at most one (there is a factor of length n with Parikh 
vector k + 1)). If 6 = 1, then \v\ = n — 2 and k' = k — 3. Thus any factor of length n 



beginning in v will have at most k — 2 many 2's, contradicting Theorem 1.3 



This concludes the proof of the lemma. □ 
Lemma 3.6. //^'t(«) n B{n) / 0, then ^t{Hn)) n B{(l){n)) / 0. 

Proof. Since ^'(t^_^) = (i, j, k), we have that 4>{n) = n + i + j + 1. Following ( [T0| , if 
'^{v) = (i — 1, J + 1, /c + 1) for some factor w of t of length n, then ^'(r(t;)0) = (n + 
l,i - l,j + 1) G ^t('^(?^)) n B{(j){n)). If ^(w) = (i + l,i - l,fc + 1) for some factor v 
of t of length n, then ^'(r(f)O) = (n + l,i + l,i - 1) G ^'t((/>(n)) n B{(j){n)). Finally 
if ^'(t;) = (i + 1, j + 1, A; — 1) for some factor of t of length n, then \I'(0~^r(f)) = 
(n-l,i + l,j + l) G ^t{(|>{n))f^B{c|){n)). □ 

Lemma 3.7. If tin) n = 0, tj^_^ w a bispecial factor of t . 

Proof. We proceed by induction on n. The result is clear forn = 1, 2. Let N > 2, and suppose 
the resuh is true for all n < N. Assume ^'t(iV) n B{N) = 0. Set ^'(t< J = (/, J,K). 
Since is right special, it ends to 0. We now show that t^_^ also begins in 0. Suppose 
t^_^ begins in 1. Then, 1^H^_^20 is a factor of t of length N, and ^'(1"H^_^20) = 
(/ + 1, J — 1, + 1) G "i>t{N) n B{N) contrary to our assumption. Similarly, assume 
t^„i begins in 2. Then, 2"H^_-^10 is a factor of t of length N, and ^'(2"H^_-^10) = 
(/ + l,J + l,-ftr-l) G ^t(^^) n B{N) contrary to our assumption. 
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Having established that begins in 0, we can write = t(u)0 for some factor u 
of t. Also, as t^_-|^ is right special, so is u and hence u = t^_^ for some n, and so = (/>(n). 

It follows from the previous lemma that ^'t('^) H B{n) = 0. Hence by induction hypothesis 

< 

N-l- 

□ 



we have that t^_^ is bispecial, and hence so is r(t^_^)0 = 



We are now ready to establish the remaining equivalences in |3.3[ We will show that (3) ^ 
(4) =^ (2) =^> (3). The previous lemma states precisely that (3) (4). That (2) =^> (3) is clear 
from (|9]l and ( [T0| . Finally to see that (4) (2), we proceed by induction on n. The result is 
clear for n = 1,2. Let N > 2 and suppose the result is true for all n < N. Assume t^ ^^ is 
bispecial. Then as t^_^ begins and ends in 0, we can write = r(u)0 for some factor u 
of t. Moreover as is bispecial, so is u. It follows that u = t^_i for some n, and hence 



N = 4>{n). By induction hypothesis we have that p{n) = 3. It now follows from Lemma 3.5 



that p{N) = 3 as required. This completes our proof of Proposition p3l □ 



As an immediate consequence of Proposition |3.3[ we have that p(n) = 3 for infinitely 
many values of n. We next show that the same is true for p{n) = 7. 

Proposition 3.8. The Abelian complexity of the Tribonacci word attains the value 7 infinitely 
often. 

Proof. Note that t begins with the square (0102010)^ = r^(OO) and so with the squares 
^n+s^QQ-j Consider any integer n > for which all factors of length m = 3914 occur in 
^n+3|-Q-j recall that a computer computation showed that /9(3914) = 7). For any factor y 
of length m there exists a conjugate x of r"+'^(0) such that xy is a factor of t"+3 (00). Since 
^(x) = ^'(r"'+^(0)) and since "^{xy) = ^'(x) + ^'(y), we deduce that Card^'t(|r"+^(0)| + 
m) > Card^'t (m) = 7. Since the Abelian complexity of the Tribonacci word is bounded by 
7, we get Card^'t(|r"+3(o)| + m) = 7. □ 



4 Conclusion 

Various aspects of the Abelian complexity of the Tribonacci word remain a mystery. For 
instance, it is surprising to us that the value pt{n) = 7 does not occur until n = 3914, but then 
re-occurs relatively shortly thereafter. As another example, it is verified that for all n < 184, 
if U and V are factors of t of length n, with U a prefix of t, then | |?7|a — I^UI < 1, for all 
a G {0, 1, 2}. But then this property fails for n = 185. We list a few related open questions: 

Open problem 1. Does the Abelian complexity of the Tribonacci word attain each value in 
{4, 5, 6} infinitely often? 

Open problem 2. For each value m G {4, 5, 6, 7}, characterize those n for which p{n) = m. 

Another problem concerns the m-bonacci word, the generalization of the Tribonacci word 
(and of the celebrated Fibonacci word) to an alphabet on m letters. This word is defined 
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as the fixed point of the morphism : {0, . . . , m — 1}* {0, . . . , m — 1}* defined by 

r„(i) = 0(z + 1) for < i < m - 2 and r(m - 1) = 0. 

Open problem 3. Prove or disprove that the m-bonacci word is (m — l)-balanced. 
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